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Abstract 

We show new limits on the efficiency of using current techniques to make exact probabilistic inference 
for large classes of natural problems. In particular we show new lower bounds on knowledge compilation 
to SDD and DNNF forms. We give strong lower bounds on the complexity of SDD representations 
by relating SDD size to best-partition communication complexity. We use this relationship to prove 
exponential lower bounds on the SDD size for representing a large class of problems that occur naturally 
as queries over probabilistic databases. A consequence is that for representing unions of conjunctive 
queries, SDDs are not qualitatively more concise than OBDDs. We also derive simple examples for 
which SDDs must be exponentially less concise than FBDDs. Finally, we derive exponential lower 
bounds on the sizes of DNNF representations using a new quasipolynomial simulation of DNNFs by 
nondeterministic FBDDs. 


1 Introduction 

Weighted model counting is a fundamental problem in probabilistic inference that captures the computation 
of probabilities of complex predicates over independent random events (Boolean variables). Although the 
problem is #P-hard in general, there are a number of practical algorithms for model counting based on 
DPLL algorithms and on knowledge compilation techniques. The knowledge compilation approach, though 
more space intensive, can be much more convenient since it builds a representation for an input predicate 
independent of its weights that allows the count to evaluated easily given a particular choice of weights; 
that representation also can be re-used to analyze more complicated predicates. Moreover, with only a 
constant-factor increase in time, the methods using DPLL algorithms can be easily extended to be knowledge 
compilation algorithms |Huang and Darwiche, 2007] , (See |Gomes et al., 2009] for a survey.) 

The representation to be used for knowledge compilation is an important key to the utility of these 
methods in practice; the best methods are based on restricted classes of circuits and on decision diagrams. 
All of the ones considered to date can be seen as natural sub-classes of the class of Decomposable Negation 
Normal Form (DNNF) formulas/circuits introduced in |Darwiche, 2001 j , though it is not known how to 
do model counting efficiently for the full class of DNNF formulas/circuits. One sub-class for which model 
counting is efficient given the representation is that of d-DNNF formulas, though there is no efficient algorithm 
known to recognize whether a DNNF formula is d-DNNF. 

A special case of d-DNNF formulas (with a minor change of syntax) that is easy to recognize is that of 
decision-DNNF formulas. This class of representations captures all of the practical model counting algo¬ 
rithms discussed in Gomes et al., 2009 including those based on DPLL algorithms. Decision-DNNFs include 
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1 











Ordered Binary Decision Diagrams (OBDDs), which are canonical and have been highly effective representa¬ 
tions for verification, and also Free BDDs (FBDDs), which are also known as read-once branching programs. 
Using a quasi-polynomial simulation of decision-DNNFs by FBDDs, |Beame et al., 2013l|Beame et al.,~2 014 
showed that the best decision-DNNF representations must be exponential even for many very simple 2-DNF 
predicates that arise in probabilistic databases. 

Recently, |Darwiche, 2011| introduced another subclass of d-DNNF formulas called Sentential Decision 
Diagrams (SDDs). This class is strictly more general than OBDDs and (in its basic form) is similarly 
canonical. (OBDDs use a fixed ordering of variables, while SDDs use a fixed binary tree of variables, 
known as a vtree.) There has been substantial development and growing application of SDDs to knowledge 
representation problems, including a recently released SDD software package SDD, 2014|. Indeed, SDDs hold 


potential to be more concise than OBDDs. [Van den Broeck and Darwiche, 2015 showed that compressing 


an SDD with a fixed vtree so that it is canonical can lead to an exponential blow-up in size, but much 
regarding the complexity of SDD representations has remained open. 

In this paper we show the limitations both of general DNNFs and especially of SDDs. We show that the 
simulation of decision-DNNFs by FBDDs from [ Beame et al., 2013] can be extended to yield a simulation of 
general DNNFs by OR-FBDDs, the nondeterministic extension of FBDDs, from which we can derive expo¬ 
nential lower bounds for DNNF representations of some simple functions. This latter simulation, as well as 
that of |Beame et al., 2013] , is tight, since |Razgon, 2015a| (see also |Razgon, 2014| ) shows a quasipolynomial 
separation between DNNF and OR-FBDD size using parameterized complexity. 

For SDDs we obtain much stronger results. In particular, we relate the SDD size required to represent 
predicate / to the ’’best-case partition” communication complexity |Kushilevitz and Nisan, 1997] of /. Using 
this, together with reductions to the communication complexity of disjointness (set intersection), we derive 
the following results: 

(1) There are simple predicates given by 2-DNF formulas for which FBDD size is polynomial but for which 
SDD size must be exponential. 

(2) For a natural, widely-studied class of database queries known as Unions of Conjunctive Queries (UCQ), 
the SDD size is linear iff the OBDD size is linear and is exponential otherwise (which corresponds to a query 
that contains an inversion |Jha and Suciu, 2013| ). 

(3) Similar lower bounds apply to the dual of UCQ, which consists of universal, positive queries. 

To prove our SDD results, we show that for any predicate / given by an SDD of size S, using its associated 
vtree we can partition the variables of / between two players, Alice and Bob, in a nearly balanced way so 
that they only need to send log 2 S bits of communication to compute /. The characterization goes through 
an intermediate step involving unambiguous communication protocols and a clever deterministic simulation 
of such protocols from [Yannakakis, 1991]. 


Related work: The 

also shown independently in 
[Beame et al., 20131 |Beame et a 


quasi-polynomial 

Razg on, 2015b| . 


2014 


of [Pipatsrisawat and Darwiche, 2010 


on 


simulation of DNNFs by OR-FBDDs that we give was 
Beyond the lower bounds for decision-DNNFs in 
which give related analyses for decision-DNNFs, the work 
structured DNNFs is particularly relevant to this papeiQ 
Pipatsrisawat and Darwiche, 2010 show how sizes of what they term (deterministic) X- decompositions can 
yield lower bounds on the sizes of structured (deterministic) DNNFs, which include SDDs as a special case. 
|Pipatsrisawat, 2010| contains the full details of how this can be applied to prove lower bounds for specific 
predicates. These bounds are actually equivalent to lower bounds exponential in the best-partition non¬ 
deterministic (respectively, unambiguous) communication complexity of the given predicates. Our paper 
derives this lower bound for SDDs directly but, more importantly, provides the connection to best-partition 
deterministic communication complexity, which allows us to have a much wider range of application; this 
strengthening is necessary for our applications. Finally, we note that | Razgon, 2014| showed that SDDs 
can be powerful by finding examples where OBDDs using any order are quasipolynomially less concise than 
SDDs. 


1 We thank the conference reviewers for bringing this work to our attention. 
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Roadmap: We give the background and some formal definitions including some generalization required 
for this work in Section [2] We prove our characterization of SDDs in terms of best-partition communication 
complexity in Section [3]and derive the resulting bounds for SDDs for natural predicates in Section |4j We 
describe the simulation of DNNFs by OR-FBDDs, and its consequences, in Section [5j 

2 Background and Definitions 

We first give some basic definitions of DNNFs and decision diagrams. 

Definition 2.1. A Negation Normal Form (NNF) circuit is a Boolean circuit with -> gates, which may only 
be applied to inputs, and V and A gates. Further, it is Decomposable (DNNF) iff the children of each A 
gate are reachable from disjoint sets of input variables. (Following convention, we call this circuit a “DNNF 
formula”, though it is not a Boolean formula in the usual sense of circuit complexity.) A DNNF formula 
is deterministic (d-DNNF) iff the functions computed at the children of each V gate are not simultaneously 
satisfiable. 

Definition 2.2. A Free Binary Decision Diagram (FBDD) is a directed acyclic graph with a single source 
(the root) and two specified sink nodes, one labeled 0 and the other 1. Every non-sink node is labeled by a 
Boolean variable and has two out-edges, one labeled 0 and the other 1. No path from the root to either sink 
is labeled by the same variable more than once. It is an OBDD if the order of variable labels is the same on 
every path. The Boolean function computed by an FBDD is 1 on input a iff there is a path from the root to 
the sink labeled 1 so that for every node label Xi on the path, is the label of the out-edge taken by the path. 
An OR-FBDD is an FBDD augmented with additional nodes of arbitrary fan-out labeled V. The function 
value for the OR-FBDD follows the same definition as for FBDDs; the V -nodes simply make more than one 
path possible for a given input. (See f Wegener, 2000 | /.j 

We now define sentential decision diagrams as well as a small generalization that we will find useful. 

Definition 2.3. For a set X, let T : {0,1} X —» {0,1} and _L : {0,1} X —5> {0,1} denote the constant 1 
function and constant 0 function, respectively. 

Definition 2.4. We say that a set of Boolean functions {pi,P 2 , ■ ■ ■ ,Pe}, where each pi has domain {0,1} X , 
is disjoint if for each i ^ j, pi/\pj = _L. We call {pi,P 2 , ■ ■ ■ ,Pe} a partition if it is disjoint and \J i=1 Pi = T. 

Definition 2.5. A vtree for variables X is a full binary tree whose leaves are in one-to-one correspondence 
with the variables in X. 

We define Sentential Decision Diagrams (SDDs) together with the Boolean functions they represent and 
use (.) to denote the mapping from SDDs into Boolean functions. (This notation is extended to sets of 
SDDs yielding sets of Boolean functions.) At the same time, we also define a directed acyclic graph (DAG) 
representation of the SDD. 

Definition 2.6. a is an SDD that respects vtree v rooted at v iff: 

• a = T ora = l. 

Semantics: (T) = T and (_L) = _L. 

G(a) consists of a single leaf node labeled with (a). 

• a = X or a = ~>X and v is a leaf with variable X. 

Semantics: (X) = X and {~>X) = —^X 

G(a) consists of a single leaf node labeled with (a). 

• a = {(pi, Si),..., (pe, s*)}, v is an internal vertex with children Vr and vr, pi,... ,pe are SDDs that re¬ 
spect the subtree rooted at vl, si, ..., Se are SDDs that respect the subtree rooted at vr, and (pi), ■ ■ ■, (pe) 
is a partition. 
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Figure 1: An SDD with its associated vtree that computes the formula (iABAC)V (~>C A D) 
Semantics: (a) = \/" =1 (( Pi ) A (si)) 

G(a) has a circle node for a labeled v with £ child box nodes labeled by the pairs ( pt , s^). A box node la¬ 
beled ( pi , Si) has a left child that is the root of G(pi) and and a right child that is the root of G(si). The 
rest of G(a) is the (non-disjoint) union of graphs G(pi ),... ,G(pe) and G(s i),... ,G(se) with common 
sub-DAGs merged. (See Figure [7],) 

Each circle node a' in G(a) itself represents an SDD that respects a subtree of v rooted at some vertex v' 
ofv; We say that of is in a and use Sdds(V,a) to denote the collection of a' in a that respect the subtree 
rooted at v'. The size of an SDD a is the number of nodes in G(a). 

Circle nodes in G(a) may be interpreted as OR gates and paired box nodes may be interpreted as AND 
gates. In the rest of this paper, we will view SDDs as a class of Boolean circuit. The vtree property and 
partition property of SDDs together ensure that this resulting circuit is a d-DNNF. 

We define a small generalization of vtrees which will be useful for describing SDDs with respect to a 
partial assignment of variables. 

Definition 2.7. A pruned vtree for variables X is a full binary tree whose leaves are either marked stub or 
by a variable in X, and whose leaves marked by variables are in one-to-one correspondence with the variables 

in X. 

We generalize SDDs so that they can respect pruned vtrees. The definition is almost identical to that 
for regular SDDs so we only point out the differences. 

Definition 2.8. The definition of a pruned SDD a respecting a pruned vtree v, its semantics, and its graph 
G(a), are identical to those of an SDD except that 

• if the root vertex v of v is a stub then (a) must be A. or T, and 

• if the root vertex v ofv is internal then we only require that (p \),..., (pt) are disjoint but not necessarily 
that they form a partition. 

We now sketch a very brief overview of the communication complexity we will need. Many more details 
may be found in Kushilevitz and Nisan, I997| . Given a Boolean function / on {0,1} x x {0,1} Y , one can 
define two-party protocols in which two players, Alice, who receives x £ {0,1} x and Bob, who receives 
y £ {0,1 } Y exchange a sequence of messages mi,...,me = f{x,y) £ {0,1} to compute /. (After each 
bit, the player to send the next bit must be determined from previous messages.) The (deterministic) 
communication complexity of /, CC'(/(X, Y)), is the minimum value C over all protocols computing / such 
that all message sequences are of length at most C. The one-way deterministic communication complexity 
of /, CCx-s.y(/(X, Y)) is the minimum value of C over all protocols where Alice may send messages to 
Bob, but Bob cannot send messages to Alice. 

For nondeterministic protocols, Alice simply guesses a string based on her input x and sends the resulting 
message m to Bob, who uses m together with y to verify whether or not f(x,y) = I. The communication 
complexity in this case is the minimum |m| over all protocols. Such a protocol is unambiguous iff for each 
(x, y) pair such that f{x,y) = 1 there is precisely one message m that will cause Bob to output I. A set of 
the form Ax B for A C {0, f} x , B C {0,I} Y is called a rectangle. The minimum of \m\ over all unambiguous 



A BC f 
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protocols is the unambiguous communication complexity of /; it is known to be the logarithm base 2 of the 
minimum number of rectangles into which one can partition the set of inputs on which / is 1. 

A canonical hard problem for communication complexity is the two-party disjointness (set intersection) 
problem, V” = 1 x i A yi where x and y are indicator vectors of sets in [n]. It has deterministic communication 
complexity n + 1 (and requires fl(n) bits be sent even with randomness, but that is beyond what we need). 
We will need a variant of the “best partition” version of communication complexity in which the protocol 
includes a choice of the best split of input indices X and Y between Alice and Bob. 

A typical method for proving lower bounds on OBDD size for a Boolean function / begins by observing 
that a size s OBDD may be simulated by a log s-bit one-way communication protocol where Alice holds the 
first half of the variables read by the OBDD and Bob holds the second half. In this protocol, Alice starts at 
the root of the OBDD and follows the (unique) OBDD path determined by her half of the input until she 
reaches a node v querying a variable held by Bob. She then sends the identity of the node v to Bob, who 
can finish the computation starting from v. Thus, if we show that / has one-way communication complexity 
CCx->.y(/(X, Y)) at least C in the best split {X, Y} of its input variables, then any OBDD computing / 
must have at least 2 C nodes. 

Our lower bound for SDDs uses related ideas but in a more sophisticated way, and instead of providing a 
one-way deterministic protocol, we give an unambiguous protocol that simulates the SDD computation. In 
particular, the conversion to deterministic protocols requires two-way communication. 


3 SDDs and Best-Partition Communication Complexity 

In this section, we show how we can use any small SDD representing a function / to build an efficient 
communication protocol for / given an approximately balanced partition of input variables that is determined 
by its associated vtree. As a consequence, any function requiring large communication complexity under 
all such partitions requires large SDDs. To begin this analysis, we consider how an SDD simplifies under a 
partial assignment to its input variables. 

3.1 Pruning SDDs Using Restrictions 

Definition 3.1. Suppose that v is a pruned vtree for a set of variables X, and that v is a vertex in v. 
Let Vars(v) denote the set of variables that are descendants of v in v and Shell(r>) = X \ Vars(u). Also let 
Parent(t>) denote the (unique) vertex in v that has v as a child. 

We define a construction to capture what happens to an SDD under a partial assignment of its variables. 

Definition 3.2. Let a be an SDD that respects v, a vtree for the variables X, and suppose that a computes 
the function f. Let B C X and A = X \ B and let p : A —> {0,1} be an assignment to the variables in A. 
Let a\ p be Boolean circuit remaining after plugging the partial assignment p into the SDD a and making the 
following simplifications: 

1. If a gate computes a constant c£ {T, _L} under the partial assignment p, we can replace that gate and 
its outgoing edges with c. 

2. Remove any children of OR-gates that compute _L. 

3. Remove any nodes disconnected from the root. 

For each vtree vertex v £ v that was not removed in this process, we denote its counterpart in the pruned 
vtree v|a by v| a - 

Construct the pruned vtree v|a from v as follows: for each vertex v, if Vars(v) C A and Vars(Parent(u)) % 
A, replace v and its subtree by a stub. We say that we have pruned the subtree rooted at v. (See Figure 
for an example of an SDD and its vtree both before and after pruning.) 
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Figure 2: An SDD and its vtree, as well as the pruned pair after setting B to 0 and A, E, F to 1. 


For A C X, we call {A.X \ A} a shell partition for X if there is a vtree vertex v £ v such that 
Shell(w) = A. We call A the shell. If, for a restriction p : A —>■ {0,1}, there exists a vtree vertex v £ v such 
that Shell(u) = A, we call p a shell restriction. 


Proposition 3.3. Let a be an SDD that respects v, a vtree for the variables X, and suppose that a computes 
the function f. Let A C X and p : A —> {0,1} be a partial assignment of the variables in A. The pruned 
SDD a\ p has the following properties: 

(a) (a\ p } = f\ p . 

(b) a\ p is a pruned SDD respecting v|a- 

(c) G(a\ p ) is a subgraph ofG(a). 


Proof, (a): An SDD may be equivalently described as a Boolean circuit of alternating OR and AND gates. 
For any Boolean circuit in the variables X that computes /, plugging in the values for the restriction p yields 
a circuit computing f\ p . Furthermore, the simplification steps do not change the function computed. 

(b) : For each v such that Vars(w) C A and Vars(Parent(u)) % A, we have replaced the subtree rooted at 
v by a stub and replaced the SDDs in a respecting v by either T or 1. Thus a\ p respects v|a- 

We now check that a\ p is a pruned SDD. In particular we need to ensure that for each SDD a' = 
{(pi, Si ),..., (jpe, Sf)} in a , the corresponding pruned SDDs that remain from p\,... ,pt in its pruned coun¬ 
terpart a'\ p represent a collection of disjoint functions. From the first part of this proposition, these are 
{Pi^lp, • • ■, ( Pi k )\ p for some k < n, where we have only included those SDDs that are consistent under p. 
Since the original set of SDDs was a partition and thus disjoint, this set of restricted (pruned) SDDs is also 
disjoint. 

(c) : The process in Definition 3.2 only removes nodes from G(a) to construct G(a\ p ). Further, it does 

not change the label of any SDD that was not removed. □ 


3.2 Unambiguous Communication Protocol for SDDs 

The way that we will partition the input variables to an SDD between the parties Alice and Bob in the 
communication protocol will respect the structure of its associated vtree. The restrictions will correspond 
to assignments that reflect Alice’s knowledge of the input and will similarly respect that structure. 

Notice that a vtree cut along an edge ( u, v ) (where u is the parent of v) induces a shell partition for X 
consisting of the set B = Vars(u), and the shell A = X \ B. 
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Proposition 3.4. Let a be an SDD of size s computing a function f : {0,1} X —> {0,1} that respects a vtree 
v. Suppose that {A. B} is a shell partition for X and that A is its shell. Let b be the vertex in v for which 
Vars (b) = B and Vars (Parent ( 6 )) $2 B. 

For any shell restriction p : A —» {0,1}, the set (Sdds a | p ( 6 |A)) is a disjoint collection of functions. 

Proof. For non-shell restrictions p ', the collection of functions (Sdds a | p , ( v )) for a vtree node v is not disjoint; 
we need to use the specific properties of A and b. Since p was a shell restriction, the pruned vtree v|a takes 
the form of a path Ui|a 5 ■ • • ,Vk\A of internal vertices, where v\ is the root of v, and Vk\A = b |a, with the 
other child of each of vi|a, • ■ ■ >^fc-i|A being a stub, together with a vtree for the variables B rooted at b. 
We will show that if (Sdds a ,| p (u i |A)) is disjoint then so is (Sdds a | p (v i+ i |a))- This will prove the proposition 
since (Sdds a | p (r>i|A)) only contains the function (a| p ) and is therefore trivially disjoint. 

We will use the fact that every pruned-SDD from Sdds Q | p (ui + i|A) is contained in some SDD from 
Sdds Q | p (uj|A)- We have two cases to check: Ui + i|A is either a left child or a right child of u^a- 

If v.i + i|a was a right child then each pruned-SDD r)\ p contained in Sdds a | p (vi\ a) takes the form p\ p = 
{(T,s| p )}. Then (Sdds Q | p (t;.j + i|A)) = (Sdds„| p (ui|A)) and is therefore disjoint by assumption. 

Otherwise suppose that Ui+i|A is the left child of Vi\ a- Let p\ p £ Sdds Q | p (uj|A)- Let p\ p = 

{(? 7 i |p, T),..., (??fc|p),T)} where Vf=i ( 7 7i Ip) = (v\p) and {(Vi\ P ), ■ ■ • > (??fc|p)} , being a collection of primes for 
f?| p , is disjoint. By assumption (Sdds a | p (uj|A)) is disjoint, so for any other rf \ p = {(rj'i\ p , T),..., ( 7 ?{./|p,T)} £ 
Sdds Q | p (iij|A)} distinct from rj\ p , we have (r]\ p ) A ( 77 '| p ) = _L. Then for any i £ [fc] and j £ [&:'], we have 
(Vi\p) A Wj\p) = -L- Thus (Sdds«| p (ui+i| A )) is disjoint. □ 

Theorem 3.5. Let a be an SDD of size s that respects a vtree v and suppose that it computes the function 
f : {0,1} X —7- {0,1}. Suppose that {A,B} is a shell partition for X and that A is the shell. Let b be the 
vertex in v for which Vars( 6 ) = B and Vars(Parent( 6 )) % B. 

Consider the communication game where Alice has the variables A. Bob has the variables B, and they 
are trying to compute /(A,B). There is a log s-bit unambiguous communication protocol computing f. 


Proof. Suppose that Alice and Bob both know the SDD a. Let p : A — > {0,1} be the partial assignment 
corresponding to Alice’s input. This is a shell restriction. Alice may then privately construct the pruned SDD 
a| p , which computes f\ p by Proposition |T3| Further, a\ p evaluates to 1 under Bob’s input 0 : B — {0,1} if 


3.4 


and only if there exists a pruned-SDD rj\ p 
By Proposition 
Vars (b) = B = X 
Sdds Q | 

Sdds a | 


Sdds Q | p ( 6 | A ) such that (r?| p )(</>) = 1 . 

Also, since p is a shell restriction with shell A, 


(Sdds a | ( 6 |a)) is disjoint. Also, since p is a shell restriction with shell A, and 
\ A, every SDD in Sdds a | (&|a) was unchanged by p. In particular, this means that 
( 6 |a) C Sdds a ( 6 ) and any pruned-SDD p\ p can be viewed as some g £ SddSo,(&) that is also in 
( b Ia). 

For the protocol Alice nondeterministically selects an 77 from Sdds a | p ( 6 | a) and then sends its identity as 
a member of Sdds a ( 6 ) to Bob. This requires at most log s bits. Bob will output 1 on his input </> if and only 
if ( 77 ) (0) = 1, which he can test since he knows a and b. This protocol is unambiguous since the fact that 
(Sdds Q | p (6|A)) is disjoint means means that for any input <f> to Bob there is at most one 77 £ Sdds Q | p (6|A) 
such that ( 77 ) (<f>) = 1. Since Bob knows a, he also knows 77 and can therefore compute ( 77 ) Since a 
computes /, if ( 77 ) (cf) = 1 then /(</>, p) = 1. Otherwise all of the functions in (Sdds a | p (b \ A )) evaluate to 0 on 
input $ so f(cj), p) = 0. □ 


We can relate the deterministic and unambiguous communication complexities of a function using the 
following result from |Yannakakis, 199Tj . We include a proof of this result in the appendix for completeness. 

Theorem 3.6 (Yannakakis). If there is an g-bit unambiguous communication protocol for a function f : 
{ 0 , 1 } A x { 0 , 1 } B —> { 0 , 1 }, then there is a (g + 1 ) 2 -bit deterministic protocol for f. 

The following 1/3-2/3 lemma is standard. 

Lemma 3.7. For a vtree v for L variables, if a vertex b satisfies < |Vars( 6 )| < we call it a (1/3,2/3) 
vertex. Every vtree contains a (1/3, 2/3) vertex. 
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Definition 3.8. Let X be a set of variables and (A, B) a partition of X. We call the partition (A, B) a 
(5,1 — 5)-partition for S £ [0,1/2] if min(|A|, |B|) > <5|X|. That is, the minimum size of one side of the 
partition is at least a S-fraction of the total number of variables. 

The best (S, 1 — 5)-partition communication complexity of a Boolean function f : {0,1} X — > {0,1} is 
min(C'C(/(A, B))) where the minimum is taken over all (5,1 — S)-partitions (A, B). 

Theorem 3.9. If the best (1/3, 2/3) -partition communication complexity of a Boolean function f : {0,1} X —> 
{0,1} is C, then an SDD computing f has size at least T^~ x . 


Proof. Suppose that a is an SDD of size s respecting the vtree v for variables X, and that a computes /. 
From Lemma 3.7 the vtree v contains a (1/3, 2/3) vertex b. This (1/3, 2/3) vertex b induces a (1/3, 2/3)- 
partition of the variables {A,B} where B = Vars(6) and A = Shell(fr). Further, this partition {A.B} is a 
shell partition. By The orem |3.5| there exists a logs-bit unambiguous communication protocol for /(A.B). 
Then by Theorem 3.6 there exists a (log(s) + l) 2 -bit deterministic communication protocol for /(A.B). 


Since the best (1/3,2/3)-partition communication complexity of / is C, we have that C < (log(s) + l) 2 
which implies that s > 2^ _1 as stated. □ 


4 Lower Bounds for SDDs 


There are a large number of predicates / : {0,1}" —► {0,1} for which the (1/3, 2/3)-partition communication 


complexity is f2(n) and by Theorem 3.9 each of these requires SDD size The usual best-partition com¬ 


munication complexity is (1/2, l/2)-partition communication complexity. For example, the function Shift- 
EDEQ which takes as inputs x, y £ {0, 1}" and 2 £ {0, 1} r iog2 and tests whether or not y = SHIFT(x, z) 
where SHIFT(x, z) is the cyclic shift of x by (*)2 positions. However, as is typical of these functions, the 
same proof which shows that the (1/2, l/2)-partition communication complexity of ShiftedEQ is D(n) also 
shows that its (1/3, 2/3)-partition communication complexity is D(n). However, most of these functions are 
not typical of predicates to which one might want to apply weighted model counting. Instead we analyze 
SDDs for formulas derived from a natural class of database queries. We are able to characterize SDD size 
for these queries, proving exponential lower bounds for every such query that cannot already be represented 
in linear size by an OBDD. This includes an example of a query called Qv for which FBDDs are polynomial 
size but the best SDD requires exponential size. 


4.1 SDD Knowledge Compilation for Database Query Lineages 

We analyze SDDs for a natural class of database queries called the union of conjunctive queries (UCQ). This 
includes all queries given by the grammar 

q ::= i?(x) | 3xq \ q A q \ q V q 

where R(x) is an elementary relation and a; is a variable. For each such query q , given an input database 
D, the query’s lineage, , is a Boolean expression for q over Boolean variables that correspond to tuples 
in D. In general, one thinks of the query size as fixed and considers the complexity of query evaluation as 
a function of the size of the database. The following formulas are lineages (or parts thereof) of well-known 
queries that that are fundamental for probabilistic databases |Dalvi and Suciu, 2012[ |Jha and Suciu, 2013 









Jha and Suciu, 2013| ): 


over a particular database Dq (called the complete bipartite graph of size m in 

H 0 = V RiSijTj 

i,je[m\ 

Qv = \J RiSij V SijTj V RiTj 

i,je[m] 

Hi = V RiSij v S^Tj 

hie M 

Hfco = V bb for k > 1 

i£[m] 

H u = V S&S?/ 1 forO<£<fc 

Hfcfc = V • S 'o 7 '/ for fc > 1. 

2G[m] 


(The corresponding queries are represented using lower case letters ho,qv,hi, hko , ■ • •, hkk and involve unary 
relations R and T, as well as binary relations S and S k . For example, ho = 3xo3yoR(xo)S(xo,yo)T(yo)-) 
The following lemma will be useful in identifying subformulas of the above query lineages that can be used 
to compute the set disjointness function. 


Proposition 4.1. Let the elements of [m] x [to] be partitioned into two sets A and B, each of size at 
least 8m 2 . Let Row(z) denote {i} x [m] and Col(j) denote [m] x {/}. Define Mr ow = {i £ [m] | 0 ^ 
Row(i) n A and 0 ^ Row(i) n B}. That is, Row(«) for i £ TTr 0 w is split into two nonempty pieces by the 
partition. Similarly, define Wc oi = {* £ [to] | 0 Col(j) D A and 0 ^ Col(j) n B}. Then 

max(| Wrow|, |Wcoi|) > VS • to. 

Proof. Suppose that both |Wr ow | < to and |Wc 0 i| < to. By definition, if i ^ Wr ow then one of A or B 
contains an entire row, Row(*), say A without loss of generality. This implies that no column Col(j) is 
entirely contained in B. Since |Wcoi| < to, there is some column Col(j) that is entirely contained in A. 
This in turn implies that B does not contain any full row. In particular, we have that A contains all rows 
in [to] \ Wrqw and all columns in [to] \ Wc 0 i and thus B C Wr ow x Wool and so \B\ < |Wro W | • |Wc 0 i|- By 
assumption, \B\ > 8m 2 . Hence |Wr ow | • |Wc 0 i| > 8m 2 and so max{|wR OW |, |rt?coi|} > • to. □ 


Theorem 4.2. Form > 6, the best (1/3, 2/3) -partition communication complexity of Qv, Hq, and of Hi is 
at least to/3. 

Proof. Let X be the set of variables appearing in Qy (or Hi) and let (A, B) be a (1/3, 2/3)-partition of X. Let 
(A, B) be the partition of [to] x [to] induced by (A, B) and define Wr ow and VFcoi as in Proposition 4.1 Since 
|X| = to 2 +2m and only elements of [to] x [to] are relevant, |A|, \ B\ > (to 2 + 2m)/3 — 2 to = (1 — 4/to)to 2 /3 > 
to 2 /9 for to > 6 and hence max(|WR OW |, |Wc 0 l|) > to/ 3. We complete the proof by showing that computing 
Qy(A.B) and Hi (A, B) each require at least max(|WR OW |, |Wc 0 i|) bits of communication between Alice and 
Bob. We will do this by showing that for a particular subset of inputs, Qv is equivalent to the disjointness 
function for a max(|WR OW |, |Wc 0 i|) size set. 


Suppose without loss of generality that |Wr ow | > |Wc 0 i|- 


Set all Tj = 0 and for each i £ Wr ow set 
Ri = 0. For each i £ Wr ow for which Ri £ A, set all Sjj £ A to 0, let ji be minimal such that S l j i £ B, and 
set Sij £ B to 0 for all j > ji. (Such an index ji must exist since i £ Wr ow -) Similarly, For each i £ Wr ow 
for which Ri £ B, set all Sij £ B to 0, let ji be minimal such that S, l j i £ A, and set Sij £ A to 0 for all 
3 > Ji- In particular, under this partial assignment, we have 

Qv = Hi= \/ R i S iji 

^EWrow 


9 






and for each i £ Wr ow , Alice holds one of Ri or Sij i and Bob holds the other. We can reduce H 0 to the same 
quantity by setting all Tj = 1. This is precisely the set disjointness problem on two sets of size |Wr ow | where 
membership of i in each player’s set is determined by the value of the unset bit indexed by i that player 
holds. Therefore, computing Qy or H t requires at least Wr ow bits of communication, as desired. □ 


Combining this with Theorem |3.9| we immediately obtain the following: 

Theorem 4.3. For m > 6, any SDD representing Qy or Hi requires size at least 2V /m / 3 ~ 1 . 

As |Jha and Suciu, 2013| has shown that Qy has FBDD size 0(m 2 ), we obtain the following separation. 

Corollary 4.4. FBDDs can be exponentially more succinct than SDDs. In particular, Qy has FBDD size 
0{m 2 ) but every SDD for Qy requires size 2^/ m ^~ 1 for m > 6. 


We now consider the formulas H k i above. Though they seem somewhat specialized, these formulas are 
fundamental to UCQ queries: [Jha~ and S uciu, 2 0131 define the notion of an inversion in a UCQ query and 
use it to characterize the OBDD size of UCQ queries. In particular they show that if a query q is inversion- 
free then the OBDD size of its lineage Q is linear and if q has an minimum inversion length k > 1 then it 
requires OBDD size 2 n ( n / k ' > where n is the domain size of all attributes. Jha and Suciu obtain this lower 
bound by analyzing the Hki we defined above. (We will not define the notion of inversions, or their lengths, 
and instead use the definition as a black box. However, as an example, the query associated with H\ has an 
inversion of length 1 so its OBDD size is 2^ m \) 

Proposition 4.5. fJha and Suciu, 20131 Let q be a query with a length k > 1 inversion. Let Dq be the 
complete bipartite graph of size m. There exists a database D for q, along with variable restrictions pi for 
all i £ [0, k], such that \D\ = O(|D 0 |) and < F^ ) | p . = 'L^ 0 . = Hki 


Theorem 4.6. Let k > 2 and assume that m > 6. Let q be a query with a length k > 2 inversion. Then 
there exists a database D for which any SDD for Q = has size at least 2 '\/ m / fc / 3 - 1 . 

Proof. Given a query q , let D be the database fo r q constructed in Proposition |4.5| Fix the vtree v over 
Xfc respected by an SDD a for 


By Lemma 


3.7 


gives a (1/3, 2/3) partition {A,B} of X&. By Proposition 4.5 there are restrictions po 


d I 

q I Pi 


there exists a (1/3, 2/3) node b in the vtree v that 

, pk such that 


= H^ for all i. Thus a\ Pi is a (pruned) SDD, of size < that of a, respecting v| p . and computing H ki . 


<F 

Observe that the restriction of {A, B} to the variables of X^.j is also shell partition of v| p . at node b. 


We w ill show that there must exist an Hki for which CC(Hki(A, B)) > m/(9fc) and therefore by Theo¬ 


rem 


3.6 


3.5 


by Theorem 
Let Wchain 


this implies that the unambiguous communication complexity of Hki is at least | \jm/k — 1 Then 

any SDD respecting v that computes H k i has size at least 2^'/ m / k ~ 1 . 
contain all pairs (i,j) for which both A 0 (J^=i 7 ^ 0 an d L5 0 (j£=i {&ij} 7 ^ 0 and Let 
7 = 1/9. We will consider two cases: either |Wchain| > 7 • m or |Wchain| < 7 • m. 

In the first case, since |Wchain| > 7 -to, there must exist at least 7 -m tuples ( i,j , £) for which either <SA £ A 
and fif/ 1 £ B or vice-versa. Call the set of these tuples T. Then, since there are k — 1 choices of t < k, 
there exists some £* such that the set T^» := T D [m] x [m] x {£*} contains at least 7 • m/{k — 1) > m/(9k) 
elements. If we set all variables of X^. outside of T^» to 0, the function Hki* corresponds to solving a 
disjointness problem between Alice and Bob on the elements of T^*. Thus the communication complexity 
of Hki* under the partition {A.B} is at least m/(9k). 

In the second case, consider the largest square submatrix M of [m] x [m] that does not contain any 
member of Wchain- We mimic the argument of Theorem 4.2 on this submatrix M. By definition, M has 
side m! > (1 — 7 ) 771 . For every (i,j) in M, either A or B contains all <SA; let A be those (i,j) such that 
these are in A and B be those ( i,j ) for which they are in B. Since |A|, |B| > |Xfc|/3 = ( km 2 + 2m)/3 and 
there are at most 2m + (y 2 + 2y )km 2 variables not in M, 

|A|, \B\ > [{km 2 + 2m)/3 — 2m + (y 2 + 2y )km 2 ]/k 
= [(1 — y) 2 — 2/3 — 4/(3fcm)]m 2 > ( 777 ./I 8 ) 2 ) 
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since k > 2. Applying Proposition 4.1 we see that max(|WR OW |, | Wc 0 i|) > m/18 > m/(9fc). By the 
same argument presented in the proof of Theorem 4.2 we have both CC(i?fco(A, B)) > |Wr ow | and 
CC(iifcfc(A, B)) > |bPcoi| so at least one of these is at least m/(9k) and the theorem follows. □ 

It follows that for inversion-free UCQ queries, both SDD and OBDD sizes of any lineage are linear, 
while UCQ queries with inversions (of length k) have worse-case lineage size that is exponential (2 n ( m / fc ) 


for OBDDs and 2 n( V m / fc ) for SDDs). Note that the same SDD size lower bound for UCQ query lineage 
Q = <f>^ applies to its dual Q* = as follows: Flipping the signs on the variables in Q* yields a function 
equivalent to ~^Q. So flipping the variable signs at the leaves of an SDD for Q* we obtain an SDD of the 
same size for ~^Q and hence a deteministic protocol that also can compute Q. 


5 Simulating DNNFs by OR-FBDDs 

In this section, we extend the simulation of decision-DNNFs by FBDDs from |Beame et ah, 2 0l3| to obtain 
a simulation of general DNNFs by OR-FBDDs with at most a quasipolynomial increase in size. This 
simulation yields lower bounds on DNNF size from OR-FBDD lower bounds. This simulation is also tight, 
since |Razgon, 2015a| |Razgon, 2014| has shown a quasipolynomial separation between the sizes of DNNFs 
and OR-FBDDs. 

Definition 5.1. For each AND node u in a DNNFD, let M u be the number of AND nodes in the subgraph 
D u . We call u’s left child ui and its right child u r . We will assume M UI < M Ur (otherwise we swap ui and 
u r J. 

For each AND node u, we classify the edge ( u,ui) as a light edge and the edge (u,u r ) a heavy edge. We 
classify every other edge in D as a neutral edge. 

For a DNNF D or an OR-FBDD JF, we denote the functions that D and T compute as $£> and 

Constructing the OR-FBDD 

For a DNNF 2?, we will treat a leaf labeled by the variable A as a decision node that points to a 0-sink node 
if X = 0 and a 1-sink node if A = 1, and vice-versa for a leaf labeled by -A. We also assume that each 
AND node has just two children, which only affects the DNNF size by at most polynomially. 

Definition 5.2. Fix a DNNF D. For a node u in D and a path P from the root to u, let S(P) be the set of 
light edges along P and S(u) = {S(P) \ P is a path from the root to u}. 

We will construct an OR-FBDD T that computes the same boolean function as D. Its nodes are pairs 
(u, s) where u is a node in D and the set of light edges s belongs to S(u). Its root is (root(D ), 0). The edges 
in T are of three types: 

Type 1: For each light edge e = ( u , v ) in V and s £ S(u), add the edge ((u, s), (v , s U {e})) to T. 

Type 2: For each neutral edge e = (u,v) in V and s £ S(u), add the edge ((u, s), ( v,s )) to T. 

Type 3: For each heavy edge (u,v r ), let e = ( u,vi ) be its sibling light edge. For each s £ S(u) and 1-sink 
node w in D VI , add the edge (( w , s U {e}), ( v r , s )) to T. 

We label the nodes v! = ( u , s) as follows: (1) if u is a decision node in D for the variable X then v! is 
a decision node in J- testing the same variable X, (2) if u is an AND-node, then u' is a no-op node, (3) if 
u is an OR node it remains an OR node. (4) if u is a 0-sink node, then u' is a 0-sink node, (5) if u is a 
1-sink node, then: if s = 0 then u' is a 1-sink node, otherwise it is a no-op node. 

We show an example of this construction in Figure [3j 

Size and Correctness 

Lemma 5.3. For the DNNF D let L denote the maximum number of light edges from the root to a leaf, M 
the number of AND nodes and N the total number of nodes. Then T has at most NM L nodes. Further, this 
is N-2 1o ^ n . 
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Figure 3: A DNNF and our construction of an equivalent OR-FBDD. 

Proof. The nodes in T are labeled (it, s). There are N possible nodes u and at most M L choices for the set 
s, as each path to u has at most L light edges. 

Consider a root to leaf path with L light edges. As we traverse this path, every time we cross a light 
edge, we decrease the number of descendant AND nodes by more than half. Thus we must have begun 
with more than 2 L descendant AND nodes at the root so that N > M > 2 L . This implies that NM L is 
quasipolynomial in N, 

This upper bound is quasipolynomial in N, we will show that M > 2 L . Then, since N > M, NM L < 
jY2 lo s 2M < N2 log2 N . □ 


The proof of the following lemma is in the full paper. 

Lemma 5.4. F is a correct OR-FBDD with no-op nodes that computes the same function as D. 

Using the quasipolynomial simulation of DNNFs by OR-FBDDs, we obtain DNNF lower bounds from 
OR-FBDD lower bounds. 

Definition 5.5. Function PERM n takes an nxn boolean matrix M as input and outputs 1 if and only if M 
is a permutation matrix. The function ROW-COL„ takes an n x n boolean matrix M as input and outputs 
1 if and only if M has an all-0 row or an all-0 column. 

Theorem 5.6. Any OR-FBDD computing PERM„ or ROW-COL, must have size 2 n (") f Wegener, 2000 | /. 
Corollary 5.7. Any DNNF computing PERM n or ROW-COL has size at least 2 a ( v/ ") 


6 Discussion 


We have made the first significant progress in understanding the complexity of general DNNF representations. 
We have also provided a new connection between SDD representations and best-partition communication 
complexity. Best-partition communication complexity is a standard technique used to derive lower bounds 
on OBDD size, where it often yields asymptotically tight results. For communication lower bound C, the 
lower bound for OBDD size is 2 C and the lower bound we have shown for SDD size is 2 ^ — 1. This is a 
quasipolynomial difference and matches the quasipolynomial separation between OBDD and SDD size shown 
in |Razgon, 2014| . Is there always a quasipolynonrial simulation of SDDs by OBDDs in general, matching 
the quasipolynomial simulation of decision-DNNFs by FBDDs? Our separation result shows an example for 
which SDDs are sometimes exponentially less concise than FBDDs, and hence decision-DNNFs also. Are 
SDDs ever more concise than decision-DNNFs? 

By plugging in the arguments of (Pipatsrisawat and Darwiche, 2010 [Pipatsrisawat, 2010] in place of 
Theorem |3.5| all of our lower bounds immediately extend to size lower bounds for structured deterministic 
DNNFs (d-DNNFs), of which SDDs are a special case. It remains open whether structured d-DNNFs are 
strictly more concise than SDDs. |Pipatsrisawat and Darwiche, 2008] |Pipat srisawat ]~2010| have proved an 
exponential separation between structured d-DNNFs and OBDDs using the Indirect Storage Access (ISA) 
function Breitbart et al., 1995 , but the small structured d-DNNF for this function is very far from an SDD. 
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It is immediate that, under any variable partition, the ISA n function has an 0(logn)-bit two-round de¬ 
terministic communication protocol. On the other hand, efficient one-round (i.e., one-way) communication 
protocols yield small OBDDs so there are two possibilities if SDDs and structured d-DNNFs have different 
power. Either (1) communication complexity considerations on their own are not enough to derive a sepa¬ 
ration between SDDs and structured d-DNNFs, or (2) every SDD can be simulated by an efficient one-way 
communication protocol, in which case SDDs can be simulated efficiently by OBDDs (though the ordering 
cannot be the same as the natural traversal of the associated vtree, as shown by |Xue et al., 2012]). 
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A Proof of Theorem 


Let /(A,B) be a function with unambiguous communication complexity g and let Mf be its communication 
matrix. Then there exists a set D of 2 9 disjoint monochromatic rectangles that cover the l’s of Mf. 

Let G be a graph whose nodes are the rectangles in D and which has an edge connecting two rectangles 
if they share some row of Mf. Then each row r of Mf corresponds to a clique K r containing the rectangles 
intersecting r. Similarly, every column c corresponds to an independent set I c containing the rectangles 
intersecting c. For each row r and column c, the corresponding entry Mf(r, c) is 1 if and only if K r n I c ^ 
0. Thus for proving the theorem, it suffices to give a g 2 deterministic protocol for solving the Clique vs 
Independent set problem on a graph G with 2 9 vertices. 

The protocol reduces the graph in each step. Suppose that Alice holds a clique K of an n vertex graph 
G and Bob holds an independent set I. In each round Alice sends a node u £ K that is adjacent to fewer 
than half the nodes of G, or if no such node exists, she notifies Bob. 

If Alice sent the node it, then Bob responds with whether (i) u £ I, in which case K n I ^ 0, or (ii) that 
u is not adjacent to any node of I, in which case K n I = 0. If neither (i) nor (ii) occur then the nodes not 
adjacent to u are removed from G as they cannot be in I\ and the protocol repeats. 

Otherwise, if every u £ K is adjacent to over half the nodes of G, Bob sends a node v £ / that is adjacent 
to at least half the nodes in G if such a v exists. In this case Alice tells Bob that (i) v £ K so that KnI yA 0, 
or (ii) v is adjacent to all nodes in K so that K n I = 0. Otherwise, Bob says he has no such v £ I and the 
nodes adjacent to v are removed from G and the protocol repeats. 

Each iteration of this protocol removes at least half the nodes so that there are at most g iterations. The 
communication per iteration is at most g + 1 (to either send one of 2 9 nodes or that no good node exists). 


B Proof of Lemma 


Lemma B.l. T is a correct OR-FBDD with no-op nodes. 

Proof. We need to show that T is acyclic and that every path reads a variable at most once. These two 
properties follow from the lemma: 

Lemma B.2. If u is a leaf node in T> labeled by the variable X and there exists a non-trivial path (with at 
least one edge) between the nodes ( u, s), ( v, s') in T, then the variable X does not occur in V v . 


5.4 


3.6 


http://reasoning.cs.ucla.edu/sdd/. 
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This lemma implies that F is acyclic: a cycle in F implies a non-trivial path from some node (u, s ) to 
itself, but A' G T> u . It also implies that every path in F is read-once: if a path tests a variable X twice, first 
at (u,s) and again at (ui,si), then X G D Ul contradicting the claim. 

To prove the lemma, suppose to the contrary that there exists an OR-FBDD node (it, s) such that it is a 
leaf labeled with X and that there exists a path from (it, s ) to (v, s') in F such that X occurs in T> v . Choose 
v such that V is maximal; i.e. there is no path from (it, s ) to some (v'. s") such that T> v C T> v i and X occurs 
in V v '. Consider the last edge on the path from (it, s) to (it, s') in F: 

(it, s ), •••> ( w , s ")i {v, s'). 


Observe that (w, v) is not an edge in T> since T> v is maximal, and (it, v) is not an edge in T> since it was a 
leaf. Therefore the edge from (w,s") to (v,s') is Type 3. So V has an AND-node z with children vi,v and 
the last path edge is of the form (w, s' U {e}), (u, s') where e = (z, v{) is the light edge of z. We claim that 
e ^ s, so that it is not present at the beginning of the path. If e G s then, since s G S(u), we have it, which 
queries A, in T> Vl . Together with the assumption that some node in T> v queries X, we see that descendants 
of the two children vi,v of AND-node z query the same variable which contradicts that V is a DNNF. On 
the other hand, e G s". Now, the first node on the path where e was introduced must have an edge of the 
form (z, si), (i >i, s\ U {e}). But now we have a path from (u, s) to (z, si) with X G T> z D V v , contradicting 
the maximality of v. 0 . 


The next proposition says that on accepting paths P = {(iq, Si), (iq, S 2 ), ■ ■ ■ (iq, s^)} in the constructed 
OR-FBDD F , the sequence of sets (si,..., se) behaves like the sequence of states of a stack. We will use 


this characterization of paths in the proof of Lemma B.4 


Proposition B.3. Suppose that F has been constructed from a DNNF V and that P = 
{(iti, si), (it 2 , S 2 ),... (iq, Sf)} is a path in F consistent with a variable assignment 9. If, for j < i, we 
have ei G Sj, e\ G Sj, and e-i G s* \ Sj, then for no k > i do we have both e-i G s*, and e\ ^ s*,. 


Proof. Suppose the statement is false. Then there must exist a Type 3 edge ((it;, sU{ei, 62 }), (iv, sU{e 2 })) in 
the path P , where w G vi. However, we cannot have e 2 G S(v r ): e 2 was an edge in V Vl because (w, sU{ei, 62 }) 
was reachable in F meaning that ei G S(w). □ 


Lemma B.4. F computes the same function as T>. That is, = $•£>[#] for all variable assignments 9. 

Proof. Suppose that = 1. Then there exists a path P in F consistent with 9 that ends in a 1-sink 

node. 

If P = {(iii, Si), (112, S2), • • • (ue, s^)} has no Type 1 edges then it also has no Type 3 edges. Therefore 
(iq, • ■ •, ut) is a path of neutral edges in T> consistent with 9 to a 1-sink with no AND-nodes along the way, 
thus <Pt> [9\ = 1. 

Otherwise, let 

S — { (lli, Si) , (llz-f 1, S^-l-i ),■••, (ill -ej : Si-hj ) } 

be a sub-path of P. We say that S corresponds to an accepting sub-DAG rooted at 11 if there is a sub-DAG 
of V rooted at the node u whose OR nodes have fanout 1, AND nodes have full fanout, leaves are all 1-sinks 
under 9 , and whose edges are iq, iq+i, ■.., iq+j. 

Starting from an empty path in F, we will work backwards from the end of P, adding two possible kinds 
of sub-path: with all Type 2 edges removed, the first contains exactly one Type 1 edge followed by one or 
more Type 3 edges. The second kind, with all Type 2 edges removed, contains only one Type 1 edge and no 
Type 3 edges. It is possible to construct P using these two types of subpath by Proposition |B.3[ which says 
that 

^15 ^2 • • • 

is the sequence of states of a stack where, as we traverse the path P, its Type 1 edges push light edges while 
its Type 3 edges pop them. As P is an accepting path, we also have that St = 0. We will show that both 
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of these types of additions give a path corresponding to an accepting sub-DAG rooted at all AND nodes 
mentioned in the Type 1 edges of the sub-path. 

In the first case, say we add the sub-path Sh to the tail path St to form S = ShSt■ Suppose Sh contains 
the Type 1 edge ((uh, s),(vi,s U {e})), and that S t corresponds to an accepting sub-DAG respecting the first 
AND node in St, which we call u tl . We wish to show that S corresponds to an accepting sub-DAG rooted 
at Uh- Sh must contain a Type 3 edge popping e, hence there is a path in V from vi to a 1-sink that is 
consistent with 9. Therefore Sh corresponds to an accepting sub-DAG rooted at Vi (there are no AND-nodes 
along the way so the sub-DAG is the path). Further, since we can find a path of neutral edges in V from 
the sibling node of vi, v r , to u t (these come from the portion of P between the Type 3 edge popping e and 
the Type 1 edge {{uh, s), ( vi,s U {e}))), S corresponds to an accepting sub-DAG rooted at v r . Therefore, S 
corresponds to an accepting sub-DAG rooted at Uh- 

In the second case, we add a Type 1 edge (( Uh,s ), (vi,s U {e})) to the tail path St, which corresponds 
to an accepting sub-DAG rooted at u t , the first AND node in S t . Then u t must appear in V vi . Otherwise 
the first Type 1 edge in St comes after we pop e, but then St began with the Type 3 edge popping e. This 
cannot happen because of our inductive assumption that we add sub-paths that begin with a Type 1 edge. 
So St gives a path of neutral edges in V from vi to u t (this is the sub-path in between the added Type 1 
edge and the first Type 1 edge in St). Since S t corresponds to an accepting sub-DAG rooted at u t , it also 
corresponds to an accepting sub-DAG rooted at Vi . Similarly, St gives a neutral edge path in V from v r to 
the first AND node mentioned after popping e. Again, from our inductive hypothesis, St thus corresponds 
to an accepting sub-DAG rooted at v r . Therefore S corresponds to an accepting sub-DAG rooted at Uh- 

Now suppose d>x)[0] = 1. Then V has an accepting sub-DAG Vg respecting 6. We can find an accepting 
path in T from edges coming from T>g. This path will follow a left-to-right traversal of Vg, keeping track of 
light edges pushed and popped. The Type 1 and Type 2 edge portions of this traversal (moving left down 
the tree) directly translate to the appropriate edges in J. The necessary Type 3 edges for this traversal also 
exist in T since Vg only has 1-sinks. At the end of the traversal we will have popped all light edges and be 
at a 1-sink in Vg so we will be at a 1-sink for T. □ 
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